Goto

Collaborating Authors

 machine learning application


I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey

Lewis, Noah, Bez, Jean Luca, Byna, Suren

arXiv.org Artificial Intelligence

Because of the increased popularity of Machine Learning (ML) workloads, there is a rising demand for I/O systems that can effectively accommodate their distinct I/O access patterns. Write operation bursts commonly dominate traditional workloads; however, ML workloads are usually read-intensive and use many small files [99]. Due to the absence of a well-established consensus on the preferred I/O stack for ML workloads, numerous developers resort to crafting their own ad-hoc algorithms and storage systems to cater to the specific requirements of their applications [50]. This can result in sub-optimal application performance due to the under-utilization of the storage system, prompting the necessity for novel I/O optimization methods tailored to the demands of ML workloads. In Figure 1, we show the evolving I/O stack used for running ML workloads (on the right side) in comparison with the traditional HPC I/O stack (on the left side). Traditional HPC I/O stack has been developed to support massive parallelism. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page.


Vulnerability Clustering and other Machine Learning Applications of Semantic Vulnerability Embeddings

Stehr, Mark-Oliver, Kim, Minyoung

arXiv.org Artificial Intelligence

Cyber-security vulnerabilities are usually published in form of short natural language descriptions (e.g., in form of MITRE's CVE list) that over time are further manually enriched with labels such as those defined by the Common Vulnerability Scoring System (CVSS). In the Vulnerability AI (Analytics and Intelligence) project, we investigated different types of semantic vulnerability embeddings based on natural language processing (NLP) techniques to obtain a concise representation of the vulnerability space. We also evaluated their use as a foundation for machine learning applications that can support cyber-security researchers and analysts in risk assessment and other related activities. The particular applications we explored and briefly summarize in this report are clustering, classification, and visualization, as well as a new logic-based approach to evaluate theories about the vulnerability space.


Product Manager, Machine Learning Applications at Schrödinger - New York

#artificialintelligence

As a member of the Machine Learning team, you'll work with both methods researchers and small molecule designers to imagine and design user experiences to leverage machine learning methods. This position offers the opportunity to influence Schrödinger's business direction and scientific functionality by bridging gaps between technical, scientific, and commercial realms.


Top 10 Companies Using Machine Learning Application

#artificialintelligence

Machine learning and artificial intelligence are two of the most important technological advances in recent times. Machine learning is a field that promises to disrupt (to borrow a favorite term) our lives as we know them. However, many of its applications are still unexplored. See the example of machine learning in action! These are 10 companies using machine learning in exciting new ways. It's easier these days to list areas in scientific R&D that Google (or, rather, parent company Alphabet-) isn't working upon, than to try to summarize Google's technological ambitions.


10 Things You Should Know As A Data Scientist

#artificialintelligence

If you are a data scientist or want to become one, there are certain things you should know. This blog post will discuss 10 of the most important ones. We will cover various topics, including machine learning, big data, and more. So whether you are just starting your data science career or looking to expand your knowledge base, read on for some valuable information! A data scientist analyzes and interprets data to find trends or patterns.


Researchers at Peking University Open-Source 'CircuitNet,' a Dataset for Machine Learning Applications in Electronic Design Automation (EDA)

#artificialintelligence

Electronic design automation (EDA), often known as computer-aided design (CAD), is a class of software tools used to create electronic systems like integrated circuits (ICs). EDA tools enable designers to create a design for large-scale integrated chips (VLSI) with billions of transistors. Due to the size and complexity of current electronic systems, EDA tools are crucial for VLSI design. The EDA research community has recently been actively investigating AI for IC methodologies to design cutting-edge chips, thanks to the explosion of artificial intelligence (AI) algorithms. Numerous studies have investigated machine learning-based solutions for cross-stage prediction tasks in the design cycle to promote speedier design convergence.


CircuitNet: An Open-Source Dataset for Machine Learning Applications in Electronic Design Automation (EDA)

Chai, Zhuomin, Zhao, Yuxiang, Lin, Yibo, Liu, Wei, Wang, Runsheng, Huang, Ru

arXiv.org Artificial Intelligence

The electronic design automation (EDA) community has been actively exploring machine learning (ML) for very large-scale integrated computer-aided design (VLSI CAD). Many studies explored learning-based techniques for cross-stage prediction tasks in the design flow to achieve faster design convergence. Although building ML models usually requires a large amount of data, most studies can only generate small internal datasets for validation because of the lack of large public datasets. In this essay, we present the first open-source dataset called CircuitNet for ML tasks in VLSI CAD.


15 Projects on Machine Learning Applications in Finance

#artificialintelligence

The finance and banking industry generates enormous amounts of data related to transactions, billing, and payments from customers, which can provide accurate insights and predictions to be fed to machine learning models. The huge volumes of transaction data have helped the finance industry streamline processes, reduce investment risks, and optimize investment portfolios for clients and companies. There is a wide range of open-source machine learning algorithms and tools that fit exceptionally with financial data. Also, financial services and banking companies have substantial funds that they can afford to spend on state-of-the-art computing hardware needed for the machine learning architecture. With the quantitative nature of the finance sector and large volumes of historical data available, machine learning in finance is poised to enhance several aspects of the industry.


6 Benefits of Using MLOps For Your Machine Learning Application

#artificialintelligence

Machine learning Operations (MLOps) is a process focused on taking machine learning models into production. It is a collaborative function that consists of data scientists, DevOps engineers, etc. The ML model goes through the development, integration, testing, deployment, and monitoring steps as DevOps. This automates the deployment of machine learning and deep learning models in massive production systems, streamlining the management process. Aligning models with both business demands and regulatory standards is simpler.


Machine Learning Applications for Supply Chain Planning

#artificialintelligence

Participants in GTPE courses are required to complete an online profile that meets the requirements of Georgia Tech Research Security. Information collected is maintained in the Georgia Tech Student System. This data is not published in Georgia Tech's online directory system and therefore is not currently available to the general public. Learner information is used only as described in our Privacy Policy. GTPE data is not sold or provided to external entities.